Speech Technology and Systems in Human
نویسندگان
چکیده
S peech technology and systems in human-machine communication have witnessed a steady and remarkable advancement over the last two decades. Fundamental changes have taken place from theoretical foundations to practical systems, from laboratory prototypes to commercial products, and from proprietary softwares to industrial standards. As the information age continues, research in speech technology is further accelerated by the advent of powerful computing devices, the data-driven pattern recognition methods, and the need to generate machine understandable metadata for Web contents and other information sources. Although various systems are built and applied to numerous applications, the full potential of speech technology still remains to be uncovered. This special section fills the need of a comprehensive review of new approaches and advances of speech technology under a broad perspective of intelligent humanmachine communication. Speech technology and systems touch upon many essential signal processing techniques and are in the core of multimodal/ multimedia communication research. We hope that such a systematic and upto-date overview of the field can bring the awareness and applications of speech technology closer to the general signal processing community. New research trends and directions in the field of speech technology and human-machine communication systems have been evolving rapidly in recent years due to the changing business environment and technology advances. With the Internet and the Web, an increasingly large amount of voice and speech data is made available. This, together with fast computing devices, leads to a new wave of advances on speech “document” understanding and multimedia/multimodal content search. New algorithms are being studied, many of which may not have been computationally feasible in the old days. Also, the large deployment of voice over IP (VoIP) has revitalized the research on noise-robust speech processing and recognition over the IP network, which is a very different environment than in the past. While scientific rigor remains the paramount selection criterion, an attempt is made to provide a balanced coverage of new research trends among articles selected for this special section. About one year ago, we put out the call for papers for articles about speech technology and human-machine communication. We received a large number of submissions, and we would like to thank all authors for their submission. After a peer-review process, nine articles were selected that provide a comprehensive overview of the landscape in speech technology and human-machine communication. A brief overview of the selected articles is provided below in the context of the general theme of this special issue—human-machine communication.
منابع مشابه
Study on Unit-Selection and Statistical Parametric Speech Synthesis Techniques
One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...
متن کاملRecognizing the Emotional State Changes in Human Utterance by a Learning Statistical Method based on Gaussian Mixture Model
Speech is one of the most opulent and instant methods to express emotional characteristics of human beings, which conveys the cognitive and semantic concepts among humans. In this study, a statistical-based method for emotional recognition of speech signals is proposed, and a learning approach is introduced, which is based on the statistical model to classify internal feelings of the utterance....
متن کاملTeaching approaches to Computer Assisted Language Learning
Computers have been used for language teaching ever since the 1960's.Learning a second language is a challenging endeavor, and, for decades now, proponents of computer assisted language learning (CALL) have declared that help is on the horison. We investigate the suitability of deploying speech technology in computer based systems that can be used to teach foreign language skills. In this case,...
متن کاملDeveloping a Standardized Medical Speech Recognition Database for Reconstructive Hand Surgery
Fast and holistic access to the patients’ clinical record is a major requirement of modern medical decision support systems (DSS). While electronic health records (EHRs) have replaced the traditional paper-based records in most healthcare organization, the data entry into these systems remains largely manual. Speech recognition technology promises substitution of the more convenient speech-base...
متن کاملA Study of the Features and Functions of speech Perseverance (With an Emphasis on the Alavi Teachings)
The serious challenge that contemporary human is encountered with has been brought about by the lack of applying ethical and behavioral necessities in his life rather than by the weakness of the rules or lack of technology. One of the mentioned important necessities is the factor of speech perseverance which has a particular conceptual and meaningful weight that is the adducing of the right spe...
متن کاملPersian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods
Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009